Picture for Chunyi Li

Chunyi Li

GeoR-Bench: Evaluating Geoscience Visual Reasoning

Add code
May 12, 2026
Viaarxiv icon

MirrorBench: Evaluating Self-centric Intelligence in MLLMs by Introducing a Mirror

Add code
Apr 16, 2026
Viaarxiv icon

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

STAR : Bridging Statistical and Agentic Reasoning for Large Model Performance Prediction

Add code
Feb 12, 2026
Viaarxiv icon

Free-GVC: Towards Training-Free Extreme Generative Video Compression with Temporal Coherence

Add code
Feb 10, 2026
Viaarxiv icon

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Add code
Jan 27, 2026
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Figure 1 for Using GUI Agent for Electronic Design Automation
Figure 2 for Using GUI Agent for Electronic Design Automation
Figure 3 for Using GUI Agent for Electronic Design Automation
Figure 4 for Using GUI Agent for Electronic Design Automation
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

Data Assessment for Embodied Intelligence

Add code
Nov 12, 2025
Viaarxiv icon